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Abstract — We consider bit interleaved coded modulation 
(BICM) receiver performance improvement based on the concept 
of generalized mutual information (GMI). Increasing achievable 
rates of BICM receiver with GMI maximization by proper scaling 
of the log likelihood ratio (LLR) is investigated. While it has been 
shown in the literature that look-up table based LLR scaling 
functions matched to each specific transmission scenario may 
provide close to optimal solutions, this method is difficult to adapt 
to time-varying channel conditions. To solve this problem, an 
online adaptive scaling factor searching algorithm is developed. 
Uniform scaling factors are applied to LLRs from different bit 
channels of each data frame by maximizing an approximate GMI 
that characterizes the transmission conditions of current data 
frame. Numerical analysis on effective achievable rates as well 
as link level simulation of realistic mobile transmission scenarios 
indicate that the proposed method is simple yet effective. 

Index Terms — BICM, generalized mutual information, LLR 
optimization 

I. Introduction 

Bit interleaved coded modulation (BICM) has been widely 
adopted in wireless standards, e.g. 3GPP High Speed Packet 
Access (HSPA) and Long-Term Evolution (LTE) Q], 0. 
For multi-level modulation schemes, BICM transmission can 
be decomposed into multiple parallel bit channels by which 
each bit channel is capable of transmitting at certain rates 
according to the bit channel's transition probability density 
function. Achievable rates of BICM can be characterized 
by the generalized mutual information (GMI) [3|, which is 
equivalent to the sum of GMI of individual bit channels. 
With the assumptions of perfect channel state information 
and optimum detection, BICM capacity has been shown to 
be achievable 0. 

In real systems, however, with imperfections in the receiver, 
e.g., limited depth interleaving, imperfect channel estimation, 
sub-optimal detection, etc., the actual achievable rate may 
be substantially lower than the BICM capacity. Such charac- 
teristics are categorized as BICM with mismatched decoding 
0. On the other hand, by certain LLR correction prior to 
decoding, the achievable rate can be improved 0. As 
shown in [5| |4|, optimal correction is to obtain LLRs that are 
consistent with the overall bit-channel transition probability 
between each coded bit and the detector output LLR. It is 
suggested in above references that LLR correction may be 
performed by off-line data analysis to first find empirical 
probability density function (PDF) of LLRs from each bit- 



channel, then numerically scale the LLRs. Specifically, a look- 
up table (LUT) with scaling factors matched to LLRs from 
each bit channel can be generated. For simpler processing, 
uniform scaling factors may be used instead of LUTs. Gen- 
erally, such off-line analysis based scaling methods are valid 
to scenarios where transmission conditions are known to the 
receiver. In typical mobile communication systems, however, 
accurate identification of transmission scenarios in real-time 
is difficult due to rapid changes in channel conditions. As 
a result, such methods may not be applicable, as applying 
mismatched scaling factors to LLRs can degrade decoder 
performance. 

In this paper, we consider an adaptive method for linear LLR 
optimization of a mismatched BICM receiver. We follow the 
GMI concept and seek to linearly optimize LLRs by scaling 
factors obtained from simple online numerical search. In this 
process, decoder decision feedback is used to calculate an ap- 
proximate GMI to search for scaling factors. As will be shown, 
this online search method provides uniform scaling factor(s) 
per bit channel that can adapt to time-varying transmission 
scenario without explicitly knowing the channel conditions. 

The paper is organized as follows. Section-|II] introduces 
the BICM system model and discusses achievable rates by 
practical receivers. Section-lHll reviews and discusses existing 
scaling methods based on off-line channel analysis. Section-ITVl 
introduces the proposed scaling algorithm. Section-fVl pres ents 
simulation results. Finally, Section-TVll concludes the paper. 

II. BICM System model and achievable rates 

We consider bit-interleaved channel-coded modulation 
transmitted over memoryless channel. Information bit se- 
quence u = [uo,ui, ■ ■ ■ , uk-i] is encoded to produce coded 
bit sequence c = [en, c\, ■ ■ ■ , Ck/r-x], where R is code rate. 
Proper interleaving and possibly scrambling of c produces a 
new bit sequence b = [&o, &].,■■• ^k/r-i\> which is then 
mapped into symbols from a M-ary signal constellation x> 
producing x = [xq,x\,--- , iejv-i], where x n G \- F° r sim- 
plicity, we consider quadrature amplitude modulation (QAM), 
but extension to other modulation schemes is straightfor- 
ward. The modulated signals are transmitted over the wireless 
channel via single or multiple antennas after layer mapping 
and precoding. At the receiver side, following detection of 
received signal, y = [yo,yi, ■ • • , Un-i], a general process of 
descrambling, rate de-matching, and de-interleaving recovers 



the LLR sequence to match that of the coded bits. The channel 
decoder then takes these LLRs as input. A generic description 
of such a process is shown in Fig. Q] 



achievable data rates. By (O it has been shown that the max- 
imum achievable rate is the generalized mutual information 
(GMI) defined as 
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Fig. 1 . A simplified block diagram of BICM receiver 

The total BICM channel may be decomposed into m — 
log 2 M parallel binary-input channels. Note that for square 
QAM with Gray mapping, bit loading in the real and imagi- 
nary dimensions of the complex signal are identical. Therefore, 
we only need to consider m/2 distinct bit channels. 

A. Achievable rates of BICM with matched and mismatched 
decoding metrics 

The BICM capacity is given by 1 3 ] 
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where I(Bj;Y) is the mutual information between the random 
variable of transmitted bit through j-th bit channel, Bj, and the 
received signal Y, p Bi (b) is the probability of bit b transmitted 
through the i-th bit channel, and py\Bi (Y\b) is the conditional 
PDF of receiving Y given b is transmitted. The above rate is 
achievable from the argument of random coding with typical 
sequences. Note that ([TJ indicates that the capacity of BICM 
systems is the sum of capacities of the modeled individual 
bit channels. With perfect interleaving and optimal detection, 
the detector output LLRs are sufficient statistics of transition 
probabilities Py\x{d\x)- For optimal detection, the definition 
of LLR is 
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In real systems, however, decoding metrics in form of 
detection output LLRs, are generally mismatched to the true 
channel transition probabilities Py\Bi (y\bi). Another decoding 
metric is introduced that works as an estimate of the bit 
channel transition probability 



qB,,Y{bi{x),y) =p Y \ Bi (y\bi) 
and the available LLR is therefore an expression of 
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Decoding with mismatched metric qB u Y{bi{x),y) instead 
of p Y \Bi(y\bi) causes performance degradation in terms of 
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where LB i: Y is the detector output LLR that will be input to 
the channel decoder. 

The above shows that the GMI is the sum of I qB Y (s) of 
all bit channels. I qB . Y (s) is defined as I-curve of the i-th 
bit channel with regard to variable s, while s that maximizes 
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(s) is called the critical point. Furthermore, it is shown 
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B. Iterative decoding and critical point 

It is well known that iterative decoding with the LogMAP 
algorithm for turbo codes (or the sum-product algorithm 
for LDPC codes) is approximately a maximum likelihood 
(ML) decoding solution when channel LLRs are optimized. 
However, while a non-iterative ML decoder does not need to 
restrict the critical point's value, iterative LogMAP decoding 
requires the critical point of the total I-curve to be one in order 
to perform well. On the other hand, iterative Max-LogMAP 
decoding will not be affected by changing the critical point. A 
shift of the total I-curve's critical point therefore has an effect 
equivalent to a fixed SNR mismatch. For best performance 
by Max-LogMAP for turbo codes (or min-sum algorithm for 
LDPC codes), it is sufficient to align critical points of bit 
channel I-curves so that they become harmoni^. For LogMAP 
decoding, however, we need to not only align critical points to 
be the same, but also align them at one. Note that if optimal 
MAP detection is applied, it will produce harmonic I-curves 
with all critical points achieved at one. 

III. Off-line based scaling functions for LLR 

OPTIMIZATION 

In recent literature, LLR scaling functions for GMI max- 
imization based on off-line analysis are presented 0. 
Below we review the processes of finding scaling factors 
followed by a discussion on the connection between the GMI 
and the consistency condition. 

'Two I-curves sharing the same critical point is called harmonic (4)- 



A. Optimal scaling based on off-line histogram analysis 

By knowledge of the given transmission scenario, two con- 
ditional PDFs can be obtained by off-line histogram estimator 
for LLRs from each bit channel. Denote pz, (Z j 6 = 1) and 
Ph{l\b = 0) as conditional PDFs of generating LLR I when 
bit b = 1 and b = are transmitted, respectively. The optimal 
scaling factor for I is 
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For simple implementation, a LUT may be generated so that 
s(l) is approximated by a piece-wise linear function and the 
two coefficients of each linear segment are stored for a certain 
range of I values. In summary, this method is carried out in 
the following steps for each bit channel: 

1) Collect a sufficient amount of data of transmitted bits 
and detector generated LLRs of i-th bit channel. 

2) Calculate pL(l\b = 1) and Ph{l\b = 0) by histograms. 



in 



P L (l\b = l) 



3) Obtain ideal scaling function Sl (l) = PL< / |tl=Q) . 

4) Approximate Si(l) by piece-wise linear function Si(l) 
and store linear function coefficients in a LUT. 

5) In real receiver processing, after detector generates chan- 
nel LLRs Z's, apply a linear scaling function on each I 
according the LUT, then continue with decoding. 

B. Uniform LLR scaling based on off-line GMI maximization 

Uniform scaling of LLRs of each bit channel corresponds to 
a shifting of the I-curve's critical point without changing the 
peak value. By aligning I-curves from different bit channels to 
be harmonic, total GMI of BICM receiver may be increased. 
The following steps summarize the process of how to apply 
linear scaling: 

1) Collect a sufficient amount of data of transmitted bits 
and detector generated LLRs of each bit channel. 

2) Calculate and plot the I-curve I qB . Y (s) as given in 

3) Obtain a scaling factor s$ that maximizes I qB Y (s). 

4) In real receiver processing, after detector generates chan- 
nel LLRs Z's, apply scaling factor Si on each I generated 
from i-th bit channel, then continue with decoding. 

For illustration, I-curves of each bit channels obtained 
from link level simulation of LTE user equipment (UE) 
receiver are plotted in upper part of Fig. The simulated 
channel profile is extended typical urban (ETU) channel. 
Channel model parameters include excess tap delay 
[0, 50, 120, 200, 230, 500, 1600, 2300, 5000]ns with relative 
power [-1.0, -1.0, -1.0, 0.0, 0.0, 0.0, -3.0, -5.0, -7.0]dB, 
and Doppler frequency 300Hz |6). Transmission scheme is 
MCS17 (64-QAM with code rate 0.4). As can be seen, 
critical points of each bit channel's I-curve do not align 
at the same value, linear scaling of LLRs can therefore 
increase total GMI by aligning 3 I-curves at the same s. For 
comparison, I-curves of the total BICM channel generated 
from detector output LLRs without any processing, LLRs 
after above mentioned LUT based scaling and uniform 
scaling, are plotted in lower part of Fig. [2] Clearly, LUT 



based scaling increases GMI more than uniform scaling does. 
However, both scaling methods result in the critical point 
achieved at s = 1. 
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Fig. 2. Top: Bit channel I-curves without LLR scaling. Bottom: total I-curves 
before and after scaled by scalar functions found off-line and online 



C. Consistency condition and GMI maximization 

Below we discuss the connection between GMI maximiza- 
tion and the consistency condition. The consistency condition 
can be defined for the random variable L as [7| [8| 
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p L (l\b = 0) 

While L opt (i.e., LLR generated from a true bit channel 
transition probability) satisfies ((9), L generated from 
generally does not. For example, in turbo decoding, consider 
BCJR decoding of a constituent convolutional code. As can 
be proven, optimal demodulation/detection generates output 
LLRs that satisfy consistency condition, which in turn will 
allow the BCJR decoder to achieve maximum a posteriori 
(MAP) decoding for the constituent code. In comparison, 
suboptimal demodulation/detection introduces two types of 



TABLE I 

Scaling factors by GMI and averaging by consistency 

CONDITION 



method/channel 


bit-0,1 


bit-2,3 


bit-4,5 


total 


a' by GMI 


1.46 


1.39 


1.04 


1.33 


s' = E L {s(l)} 


1.40 


1.45 


1.04 


1.32 



imperfection. Type-I: it produces less reliable estimates of 
transmitted coded bits, which can be modeled as a worse bit 
channel; Type-II it may violate the consistency condition, 
which can further degrade decoding performance due to the 
LLR's inaccurate representation of bit probabilities. Correction 
of mismatched LLRs does not improve the bit channel, but 
can correct/reduce the inaccuracy of bit probabilities contained 
in the LLR values. In other words, potential performance 
improvement is from reducing negative effects of Type-II 
but not of Type-I. 

Regarding optimal scaling function that maximizes GMI 



(i.e., s(Z) 
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as shown in J5) J4]), we see it is 



equivalent to process the LLRs such that the scaled LLRs 
satisfy the consistency condition. For optimal uniform scaling 
factors, we conjecture that the uniform scaling per bit channel, 
that aligns each bit channels to be harmonic at s = 1, is equal 
to the mean value of the optimal scaling function as 



a' = E L {s(l)} = E L 



In 
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In other words, when uniform scaling is considered, the 
mean value from scaling function satisfying the consistency 
condition not only maximizes GMI but also aligns I-curves 
of bit channels to be harmonic at s = 1. Although proof of 
this conjecture seems difficult, numerically we have found it is 
supported. For example, we test max-LogMAP demodulation 
of 64 — QAAI signals over fast Rayleigh fading channel at 
SNR= 7dB, uniform scaling factors found by using method 
as Section lHI-Bl and by using s' = El{s(1)} give us the results 
as in TABLE. [Q 

IV. Decoder decision-aided online LLR scaling 

FOR GMI MAXIMIZATION 

Due to different statistical characteristics of LLRs obtained 
from different transmission scenarios, LUTs obtained in one 
case may be substantially different from the desirable ones 
for another case. The same problem occurs for the mean 
scaling factors obtained off-line. Therefore it is desirable to 
obtain scaling functions adaptive to the channel, transmission 
scheme, and other variations that the receiver may experience. 
Below we consider online uniform scaling based on GMI 
maximization. By adopting this approach, we may anticipate 
performance similar to previous off-line uniform scaling per 
bit channel based on GMI maximization but with real-time 
adaptation to channel conditions. 
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Fig. 3. Receiver scheme flowchart. 

A. Uniform scaling per bit channel 

To enable online calculation of the scaling factor for each 
transport block or code block, we can again apply decoder 
hard decisions to provide estimates of bi(X), bi(X). As a 
result, we have an approximation of © for each transport 
block that characterizes current transmission conditions. For 
this purpose, we can calculate an approximate I-curve function 
for any specific s as 

IqBi,Y{s) 

= 1 - E x .y {log 2 (l + exp(- sgn{bi(X))L BitY s)) } 

1 Ni-l 

~ 1 - tt log 2 f 1 + CX P( - sgn(S;, n )Li,„s) 



i=0 



(ID 



where Ni is the total number of bits transmitted through the 
z-th bit channel. The optimal scaling factor for the i-th bit 
channel is therefore 



Si 



arg max La ,y(s) 

s>0 



(12) 



Based on ( fl2b , the flowchart of the proposed receiver 
scheme after detection is shown in Fig. [3] 

In practice, to identify an approximately maximized GMI, 
we may search for s that maximizes I q Bi,Y(s)- Note that 
I q Bi,Y(s) is convex with regard to s. To search for the 



optimal s, we can initially set s = 1, and a multiplicative 
coefficient a > 1, e.g., a = 1.05, then compare I q Bi,Y( s ) w i tn 
iqBi.y^s) or l q Bi ,y(s/a) to determine whether to increase 
or to decrease s, then recursively set s — as or s = s/a until 
the peak of the convex curve by i g s ii y(s) is identified. The 
search process is summarized in the following 4 steps: 
Initialization: set s = 1, a = 1.05. 

1) Calculate I q Bi,Y{s) and I qBi! Y(ots) according to (fTTT i. 



2) If I qBi ,Y(s) > I qBl ,Y{as), set a = 
IqBi,Y{s) = I q Bi,Y(as), set s = as 



l/a. Otherwise: let 

> i q Bi,Y{as), set 



3) Calculate / g s i ,y(as). If / g s i ,y(s) 
Si = (s + as) 1 2. Stop. 

4) Otherwise, let j g s i ,y(s) = / 9 s i ,y(as), set s = as. Go 
back to Step 3. 

For verification, we numerically calculate the I-curve ob- 
tained from the above scaling factor search algorithm for 
MCS17 over ETU300 with real channel and noise variance 
estimation. Estimation in the link level simulator is based on 
frequency domain minimum mean square error (MMSE) esti- 
mation using pilot symbols. It is then compared with I-curves 
from LLRs without scaling, off-line LUT based scaling, and 
off-line uniform scaling. Fig. [2] verifies that uniform scaling 
based on online GMI maximization can slightly increase GMI, 
while at the same time maximizes the I-curve approximately 
at s = 1. 

B. Uniform scaling of positive and negative LLRs separately 
per bit channel 

For 16-QAM and 64-QAM, due to bit loading by Gray- 
mapping and max-log type detection, optimal scaling functions 
by (O for LLRs from the first bit channel are symmetric 
around 0, but not so for LLRs from the second and/or the 
third bit channels. For the second and third bit channels, 
it becomes reasonable to search for one scaling factor for 
positive LLRs and another for negative LLRs. For verification, 
we plot I-curve obtained by applying different uniform scaling 
for positive and negative LLRs separately, and then compare 
with the results we obtained in Fig. [2] We denote the new 
scaling method as "2-level scaling via online GMI". The 
results show a substantial increase in GMI is obtained by 2- 
level scaling and online GMI maximization. 

C. Accuracy of scaling factors by decision feedback 

We next study the deviation of scaling factors found by 
decoder decision feedback from "genie-aided" scaling factors 
that can be obtained assuming true transmitted bits are avail- 
able. To quantify the extent of errors incurred by decoder 
decisions, we define a normalized mean error as E{ s ~ s } 
where s is the desired scaling factor and s is the actual 
available scaling factor. Numerical comparisons are carried 
out for the MCS17 over ETU300 channel at various SNRs. 
In Fig. |U the top figure shows the mean values of the "genie" 
scaling factors at different SNRs, while the bottom one plots 
the normalized mean error of the actual scaling factors. In 
these figures there is only one curve for bit channel 0,1 
because by Gray-mapping this channel's transition probability 



is symmetric for positive and negative LLRs. We make the 
following remarks based on Fig. [4] Firstly, the mean value of 
the scaling factor of each bit channel gradually decreases and 
approaches one. This is consistent with the fact that at very 
high SNR, channel estimation becomes more reliable while 
the output by sub-optimal MLM detection also approaches that 
by optimal MAP detection. Secondly, overall the reliability of 
scaling factors from online decisions become more reliable 
as SNR increases, while the convergence rate toward "genie" 
decision-aided results varies by each bit channel and the sign 
of the LLRs. 

ETU300 2x2, MCS17 
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Fig. 4. Top: Mean scaling factors of each bit channel. Bottom: Normalized 
mean error of scaling factors by decoder decisions 



V. Simulations 

We test the performance of the proposed scaling method 
under realistic channel conditions for an LTE system. Simu- 
lations are based on the link level simulator of LTE downlink 
2x2 MIMO channel with open loop spatial multiplexing 
transmission. The system occupies 10 MHz bandwidth using 
frequency division duplexing (FDD). Tested channel profiles 
include Extended Vehicular A (EVA) and Extended Typical 
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Fig. 5. FER: EVA70 channel, MCS10, number of physical resource block 
= 50, Bandwidth lOMhz 

Urban (ETU) channels (6). Modulation and coding sets (MCS) 
include MCS10 and MCS 17, which correspond to 16-QAM 
and 64-QAM modulation and code rates around 0.3 and 
0.4, respectively. At the receiver, a max-log type detector is 
applied. The channel decoder can be either LogMAP (LM) 
or scaled max-LogMAP (S-MLM) decoder. The maximum 
number of iterations performed by turbo decoding is 8. For the 
S-MLM decoder, extrinsic LLRs of each constituent decoder 
are scaled by 0.7 before being used as a priori LLR by 
the other constituent decoder in order to improve decoding 
performance J9). As described in Fig. [3] in order to apply 
the proposed LLR scaling, one initial iteration of S-MLM 
decoding is applied. After LLR scaling, a maximum of 7 
iterations is continued with either LM or S-MLM decoding. 
Frame error rate (FER) comparisons include decoding with 
LM or S-MLM algorithms without scaling LLRs, and de- 
coding with the proposed scaling method. Fig. |5]|6] show the 
results. Without scaling, LM decoding performs usually worse 
than S-MLM decoding due to LM's sensitivity to imperfect 
input LLRs. However, with proper scaling, while S-MLM 
decoding's performance improves slightly, LM decoding's 
performance is improved significantly. This is consistent with 
the observation of I-curves in Fig. [2] where LLR scaling not 
only increases total GMI but also aligns the critical point at 
one. As expected, after LLR scaling, LM decoding outper- 
forms S-MLM decoding substantially. From above cases, LLR 
scaling provides more gains over the ETU300 channel, which 
indicates that the proposed method is more effective when 
transmission conditions are more severe. 

VI. Conclusions 

As the next generation mobile communication systems are 
expected to provide high throughput at expanded coverage and 
in versatile environments, receiver adaptation to the changing 
environment is important for system performance. In this 
work, based on the relationship between the BICM achievable 
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Fig. 6. FER: ETU300 channel, MCS 17, number of physical resource block 
= 50, Bandwidth lOMhz 

rates and the GMI, we introduce a simple and effective method 
of LLR optimization to improve receiver performance. In order 
to perform LLR scaling that is adaptive to various transmission 
conditions, an online GMI maximization based scaling factor 
searching algorithm is developed. Specifically, an approximate 
I-curve function is calculated by using decoder decisions after 
certain initial decoding iteration(s), followed by a numerical 
search of the scaling factor that aligns the critical point of 
each bit channel's I-curve at one. While the effectiveness of 
this method depends on decoder decisions' reliability, we note 
that in practical operating SNR regions, the decoder decisions 
provide reasonable accuracy in determining the scaling factors. 
Increases in GMI by the proposed method are confirmed 
by numerical analysis. Performance improvement in realistic 
systems is further verified in link level simulation of LTE 
downlink transmissions. 
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